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ABSTRACT 


This  piper  gives  an  overview  of  the  problems  Involved  in  the 
construction  of  a  computer-based  question -answering  system 
designed  to  interact  with  the  user  in  English.  The  system  is 
viewed  as  containing  five  distinct  parts  —  a  parser,  a 
semantic  interpreter,  an  information  storer,  an  information 
retriever,  and  an  English  output  generator.  There  is  a  need 
for  extensive  interaction  among  these  subsystems,  and  between 
the  subsystems  and  the  user.  Examples  are  given  of  the  type 
of  processing  done  cy  each  subsystem,  and  the  nature  of  the 
possible  interac tions.  The  syntactic  analysis  described  is 
based  on  a  Chomsky  type  of  transformational  grammar.  The 
semantic  store  is  characterized  by  a  form  of  the  predicate 
calculus,  with  additional  algorithms  for  computation,  and 
structures  designed  for  fast  access  to  relevant  data. 
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SECTION  I 


INTRODUCTION 

YJith  the  advent  of  time  -shared  computers,  an  interactive 
computing  facility  has  been  placed  at  the.  command  of  large 
numbers  of  people.  However,  the  computer  remains  inacces¬ 
sible  to  many  because  they  cannot  speak  a  language  that  the 
computer  understands.  Development  of  programs  which  "under¬ 
stand"  natural  language  (in  the  sense  described  _below)  will 
allow  interaction  between  man  and  machine  to  take  place, 
for  example,  in  English. 

There  are  a  number  of  projects  throughout  the  country  which 
are  attempting  to  develop  computer  systems  which  will  accept 
English  sentences  typed  into  the  computer,  and  respond  to 
the  user  in  English.  This  article  is  based  on  work  in  pro¬ 
gress  at  Bolt  Beranek  and  Newman  Inc.  on  one  such  system, 
called  SENSE  (SEmantic  Network  Storage  Experiment).  This 
system  is  not  yet  completely  implemented,  and  thus  some  of 
what  follows  should  be  considered  in  the  nature  of  speculation, 
tte  present  here  only  an  overview  of  some  of  the  problems 
(mostly  unsolved)  involved  ir.  development  of  natural  language 
communication  with  computers.  » Je  use  simple  examples  to 
illustrate  these  problems,  and  go  Into  no  detail  on  solutions. 
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SECTION  II 


NATURAL  COMMUNICATION 

Before  proceeding  further,  let  us  warn  the  reader  that 
natural  language  is  not  always  the  most  natural  medium  for 
communication.  A  physicist  describing  the  motion  of  objects 
in  a  gravitational  field  uses  a  differential  equation  because 
it  is  more  natural,  and,  more  important,  more  precise.  An 
architect  feels  a  picture  is  worth  a  thousand  word3.  Both 
use  their  own  languages  (mathematics  and  graphics)  to  inter¬ 
act  with  computers.  Even  in  cases  where  English  would  do, 
when  the  same  wordy  message  must  be  transmitted  again  and 
again,  or  very  rapidly,  people  use  codes  or  push  buttons  to 
"converse"  with  computers. 

\ 

When,  the^i,  is  natural  language  "natural?"  —  in  cases  where 
no  code  ol*  Jargon  has  been  generated,  where  messages  are 
seldom  repeated,  or  where  the  ideas  to  be  transmitted  are 
not  really  precisely  defined.  An  important  aspect  of  the 
communication  between  people  is  that  a  listener,  by  aslcing 
the  right  questions  of  a  speaker,  forces  him  to  define  more 
carefully  the  relationships  he  is  describing.  In  our  efforts 
to  build  computer  systems  to  understand  natural  language,  we 
cannot  ignore  this  important  type  of  interaction.  A  good 
natural -language  system  must  be  active,  not  passive,  and  must 
work  with  the  user  to  achieve  an  understanding. 
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We  have  used  the  word  "understand"  in  connection  with  a  com¬ 
puter  system.  Let  us  provide  an  operational  definition  of 
this  concept.  We  cannot  talk  about  a  computer  "understanding" 
a  sentence  in  vacuo.  The  computer  understands  only  in  the 
context  of  a  body  of  information  and  procedures  which  it  con¬ 
tains.  All  of  the  systems  under  construction  contain  a 
fairly  complex  store  of  information  and  set  of  procedures. 

The  critical  test  of  understanding  is,  can  the  system  answer 
a  question  phrased  in  English  based  on  its  3tored  Information? 
fne  "depth  of  understanding,"  or  the  problea-solving  ability 
of  the  program,  can  be  measured  by  how  "obvious"  the  answer 
is  from  the  data  base;  retrieval  of  a  prestored  fact  is  simpler 
than  a  deduction  or  computation  based  on,  or  implicit  in,  the 
oa  bd  oG  . 

There  are  limitations  to  the  English  that  will  be  accepted 
by  natural -language  interactive  systems  (including  people). 
SENSE  is  no  exception.  These  limitations  fall  broadly  into 
two  classes  —  syntactic  and  semantic.  Syntactic  limitations 
deal  with  the  forms  of  statements  or  questions  the  system  will 
accept.  For  example,  the  system  may  recognize  only  "active 
sentences"  such  as  "John  read  the  book,"  and  not  accept  passive 
forms  such  as  "the  book  was  read  by  John."  Semantic  limita¬ 
tions  refer  to  the  kind  of  things  you  can  talk  about.  The 

3 

STUDENT  program,  for  example,  could  answer  only  questions 
framed  in  the  context  of  algebra  story  problems;  for  example: 
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Mary  was  twice  as  old  as  Ann  was  when  Mary 
was  twice  as  old  as  Ann  is  now.  It  Mary  is 
24,  how  old  is  Ann? 

Within  this  very  limited  context  STUDENT  demonstrated  a 
remarkable  understanding  (i.e.,  question -answering  ability), 
but  was  incapable  of  understanding  things  outside  thi3  se¬ 
mantic  domain.  Similarly,  the  question -answering  system 
currently  under  development  at  Technical  Operations,  Inc., 
Eurlington,  Massachusetts  (personal  communication),  can  answer 
only  questions  relevant  to  its  data  base,  which  is  a  ceded 
form  of  the  airline  guide.  Questions  outside  this  context 
are  rejected  as  not  understandable. 

These  limitations  effect  the  usefulness  of  a  natural  language 
system.  Some  other  critical  questions,  however,  are:  how 
comfortable  is  a  person  confined  within  these  limits  in 
communicating  with  the  system;  and  how  easily  can  the  system 
be  extended  to  include  new  syntactic  structures  and  semantic 
relations?  We  will  discuss  these  questions  after  we  describe 
the  structure  of  the  semantic  store  and  the  form  of  grammar 
of  the  language. 

The  flow  diagram  in  Figure  1  illustrates  the  general  struc¬ 
ture  of  the  "SENSE"  question -answering  system. 

The  solid  lines  indicate  flow  within  the  system  when  the 
system  can  process  an  input  without  further  help  from  the  user. 
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FLOW  DIAGRAM  OF  THE  SENSE  QUESTION  ANWERING  SYSTEM 


The  clotted  lines  represent  queries  back  to  the  user  for  help. 
For  example,  if  the  user  says,  "John  likes  flying  planes," 
the  system  might  inquire  whether  "flying"  should  be  considered 
a  verb  or  an  adjective  in  this  sentence  —  thus  eliminating 
this  syntactic  ambiguity  before  further  processing.  Similarly, 
the  semantic  ambiguity  in  "Parker  is  in  the  pen"  can  be  elim¬ 
inated  by  asking  the  user  if  he  ij  referencing  a  fountain  pen 
or  pig  pen  (unless,  of  course,  Parker  i3  a  convict).  Finally, 
the  retriever  might  need  additional  information  to  determine 
the  answer  to  a  question  such  as  "how  long  does  AA-57  take 
to  get  to  L03  Angeles?"  It  could  ask  the  user  which  starting 
point  of  the  flight,  e.g..  New  York  or  Chicago,  does  he  wish 
to  consider.'  All  of  these  queries  are  directed  to  the  user 
through  a  program  which  converts  the  computer  query  into 
English.  No  one  has  yet  built  a  program  which  transforms 
a  semantic  representation  of  an  "idea"  or  relationship  into 
English  output.  At  present,  we  compromise  by  substituting 
into  prescribed  forms.  At  some  level  this  may  be  all  we  can 
really  do. 
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SECTION  III 


THE  STRUCTURE  OF  THE  SEMANTIC  STORE 

2 

The  basic  comoonents  of  the  semantic  store  are  five  sets  — 
objects,  functions,  x*elations,  propositions,  and  semantic 
deductive  rules.  The  semantic  interpretation  of  an  English 
statement  is  an  assertion  that  a  (set  of)  relation(s)  holds 
among  some  specified  objects.  A  proposition  is  an  Instance 
of  a  relation  with  particular  objects  filled  in  for  the  argu¬ 
ments.  A  proposition  can  be  thought  of  as  a  "belief"  of  the 
system.  Suppose  we  want  to  add  the  English  statement  "John 
flies  the  plane"  to  the  semantic  store.  This  would  be  re¬ 
presented  as  an  instance  of  the  relation  fly  with  John  as 
the  agent  argument  of  the  relation,  and  plane  as  the  object. 

We  italicize  fly ,  John,  and  plane  to  emphasize  the  fact  that 
they  are  not  words,  but  names  for  relations  and  objects  in 
the  data  base. 

In  addition  to  these  "simple  names"  for  objects,  there  are 
"functional  names."  A  function  In  this  system  Is  a  mapping 
from  n-tuples  of  objects  into  an  object.  An  example  of  a 

function  is  one  represented  by  "pilot  jf  _ , "  and  pilot 

of  the  plane  is  a  functional  name  which.  In  the  context  of* 
the  first  statement,  is  another  name  for  the  object  denoted 
by  John .  An  important  (and  difficult)  part  of  the  question- 
answering  process  is  determining  a  simple  name  (or  names,  since 
it  may  not  be  unique)  which  refers  to  an  ooject  denoted  by  a 
functional  name. 
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SECTION  IV 


THE  QUESTION -ANSWERING  PROCESS 

There  are  two  forms  of  questions  that  can  be  asked  of  the 
system.  The  first,  which  we  call  a  closed  question,  asks 
if  a  given  proposition  is  consistent  with  the  data  base. 

The  answer  to  this  type  of  question  may  be  yes,  no,  probably, 
probably  not,  or  I  don ' t  know.  The  answer  yes  occurs  if 
the  system  can  find  this  proposition  in  its  data  store  —  or 
if  it  can  add  it  to  its  store  by  use  of  its  semantic  de¬ 
ductive  rules.  There  are  three  basic  types  of  semantic 
deductive  rules  —  paraphrase  rules,  relational  deductive 
rules,  and  the  metarules  of  the  deductive  system  Itself. 

A  paraphrase  rule  gives  a  paraphrase,  or  first  level  impli¬ 
cation  of  the  use  of  a  word  In  a  sentence,  and  thus  the  use 
of  an  object  or  relation  in  the  semantic  network.  It  con¬ 
tains  the  definition  of  the  words  denoting  objects  and 
relations  in  the  system.  As  an  example,  the  following  is  a 
transliteration  of  part  of  what  we  store  about  the  relation 
fly  (with  the  meaning,  a  pilot  flies  a  plane)  in  the  SENSE 
system: 

1.  (Context  Information).  Fly  may  be  used  in  a  context 
where  there  is  an  agent,  call  it  S,  and  an  object,  call  it  D. 
S  is  a  person,  probably  a  pilot  and  D  Is  probably  an  air¬ 
craft. 
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2.  (Paraphrase  Information).  "S  flies  D"  implies  (can 
be  paraphrased  as)  "S  moves  in  D  through  the  air." 

Thus,  with  this  information  and  the  earlier  statement,  the 
system  can  deduce  the  answer  "yes"  to  the  question  "does  John 
move?"  Also,  with  this  information,  this  system  will  answer 
"probably"  to  the  question  "is  John  a  pilot?",  using  the 
context  information  provided.  It  will  also  use  the  context 
information  and  the  fact  that  an  airplane  is  not  a  person  to 
answer  "no"  to  the  question  "does  an  airplane  fly  John?" 

The  paraphrase  rules  usually  deal  with  only  one  relation. 

The  relational  deductive  rules  provide  a  way  of  using  several 
propositions  to  obtain  a  single  proposition.  A  typical  ex¬ 
ample  of  a  relational  deductive  rule  Is  the  one  which  defines 
the  transitivity  of  the  relation  in_,  meaning  inside  of.  It 
states  that  "A  is  in  B  and  B  is  in  C  implies  A  is  in_  C"  where 
A,  B,  and  C  can  be  any  objects. 

Implicit  in  our  use  of  these  two  types  of  rules  have  been  the 
metarules  of  the  deductive  system:  the  rule  for  substitution 
and  the  rule  for  detachment.  These  can  be  summarized  for  our 
system  by  the  following: 

1.  If  there  is  a  rule  implies  Rg,  with  variables, 
then  Implies  P2  is  a  true  statement,  provided  P1  and  P^, 
are  derived  by  substituting  any  consistent  set  of  objects 
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for  variables  in  and  Rg. 

2.  If  is  a  proposition  in  the  semantic  store  (i.e., 
a  belief  of  the  system)  and  implies  Pg  is  in  the  system 
or  can  be  put  in  the  system  by  substitution  (rule  1),  then 
Pg  can  be  added  to  the  semantic  store.  These  two  rules  were 
both  used  in  the  example  cited  above. 

A  critical  oro'clem  in  answering  questions  with  such  a  deduc¬ 
tive  system  is  identifying  the  relevant  propositions  and  re¬ 
lations  to  make  the  deductions  pertaining  to  the  question. 

An  exhaustive  search  through  a  large  data  base  would  be  im¬ 
practical,  to  say  the  least.  For  limited  data  bases,  well 
constructed  indexes  helo  cut  down  the  search.  For  general 
semantic  stores,  the  problem  is  more  difficult,  and  no  really 
good  solution  has  yet  been  found.  The  search  problem  Is  not 

unique  t;  semantic  deductive  systems,  but  Is. ubiquitous  throug 

8 

most  problems  In  artificial  intelligence.  In  the  SENSE 
system,  we  will  have  links  to  propositions  from  both  a  token 
of  che  relation  and  from  some  of  the  objects  used  as  argu¬ 
ments  In  the  proposition.  The  choice  of  which  objects  to 
link  to  the  proposition  is  dependent  on  the  context,  and  the 
relative  importance  of  different  objects  and  relations.  This 
measure  of  importance  represents  a  "point  of  view"  for  the 
question -answering  system.  Nith  a  point  of  view,  answering 

i 

some  questions  will  be  easier  than  answering  others  —  with 
n~ne,  all  questions  will  be  equally  difficult. 
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The  problem  of  efficient  search  comes  up  again  in  answering 
"open  questions,"  ones  which  ask  for  the  name  of  an  object 
which  satisfies  a  given  relationship.  An  example  of  an  open 
question  is  "what  city  does  Flight  AA-57  go  to?"  Of  course, 
the  answer  may  not  be  unique,  and  we  would  expect  a  question¬ 
answering  system  to  give  a  list  of  names  of  all  the  objects 
which  satisfy  the  specified  relationship.  Thus  if  AA-57 
starts  from  Doston,  and  stops  in  Chicago  on  the  way  to  Los 
Angeles,  the  question-answering  system  should  answer  "AA-57 
goes  to  both  Chicago  and  Los  Angeles." 

An  alternative  form  of  an  open  question  ask3  for  a  simple 
name  of  objects  denoted  by  a  given  functional  name.  For 

example,  if  "destination  of  _ "  were  a  functional  form  in 

the  system,  then  "What  is  the  destination  of  Flight  AA-57?" 
is  an  alternative  form  of  the  open  question  stated  earlier. 
Associated  with  every  function  in  the  system  must  be  a  pro¬ 
gram  that  can  compute  the  value(s)  of  the  function  given  its 
arguments.  T'ere  is  an  obvious  duality  between  functions  and 
relations.  A  question  can  be  interpreted  as  a  request  for  a 
computation  of  the  value  of  a  function,  or  a  request  for  a 
search  for  objects  wnlch  satisfy  a  specified  relation.  This 
will  depend  on  whether  there  is  a  program  available  in  the 
system  to  compute  the  value  of  the  function,  and  if  it  exists 
its  efficiency  relative  to  a  search  for  the  answer. 
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SECTION  V 


SYNTACTIC  ANALYSIS 

Many  programs  have  been  developed  to  perform  syntactic 
analysis  of  English  on  the  computer.^  The  question  we  will 
address  here  is  how  such  an  analysis  can  be  used  in  the 
question -answering  systems.  The  first  problem  which  can  be 
attacked  with  the  aid  of  a  syntactic  analysis  13  the  deter¬ 
mination  of  the  arguments  of  a  relation.  From  the  parsing 
of  "the  dog  bit  the  man,"  we  can  determine  that  the  agent 
of  the  relation  denoted  by  bit  is  "the  dog,"  and  the  object 
"the  man."  Thus,  we  can  determine  the  difference  between 
this  sentence  and  "the  man  bit  the  dog."  The  subject  of  the 
sentence  is  the  agent  argument  of  the  relation  in  each  case. 

The  problem  becomes  more  difficult  if  we  consider  the  sen¬ 
tence  "the  man  was  bitten  by  the  dog."  Although  this  is  a 
paraphrase  of  the  first  sentence,  the  subject  is  not  the 

agent  of  the  relation  bit.  In  line  with  the  transformational 

*  R 

theory  of  grammar  as  developed  by  Chomsky, J  and  others,  we 
consider  the  obvious  parsing  of  this  passive  sentence  only 
a  "surface  structure."  According  to  this  syntactic  theory, 
the  surface  structure  of  a  sentence  is  derived  from  a  deep 


*  Although  we  are  using  a  transformational  theory  of  grammar, 
our  semantic  analysis  is  very  different  than  that  proposed  by 
Foder  and  Katz6  for  obtaining  semantic  readings  from  a  trans¬ 
formational  analysis  of  a  sentence. 


-12- 


The  passive  and 


structure  by  a  series  of  transformations, 
and  the  active  forms  of  a  sentence  have  the  same  deep 
structure,  and  differ  on  the  surface  because  the  optional, 
and  meaning  preserving,  passive  transformation  was  applied 
to  yield  the  former.  It  is  this  deep  structure,  then,  which 
allows  us  to  determine  the  relationship  of  arguments  to  a 
relation  in  a  sentence. 

Transformational  syntactic  analysis  helps  in  resolving  another 
problem.  Consider  the  sentence  "the  man  who  wrdte  the  pro¬ 
gram  debugged  it."  The  man  referred  to  in  this  sentence  is 
an  argument  in  two  prepositions  "man  wrote  program"  and  "man 
debugged  program."  The  deep  structure  of  this  sentence  shows 
both  embedded  sentences  in  this  explicit  form,  and  shows  the 
relationship  between  the  objects  named  in  each.  Transforma¬ 
tions  carry  this  deep  structure  to  the  surface  structure  in 
which  there  is  a  relative  clause,  and  a  pronoun  substituted 
for  "program."  Thus,  the  deep  structure  analysis  of  this 
sentence  provides  the  semantic  analysis  portion  of  a  question- 
answering  system  with  a  form  in  which  each  relation  in  the 
sentence  Is  explicit. 
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SECTION  VI 
CONCLUSION 


We  have  been  discussing  the  problems  involved  in  building 
a  computer-based  question -answering  system  which  can  con¬ 
verse  with  a  user  in  natural  language.  Work  has  really  just 
begun  on  these  problems.  There  is  not  yet  even  a  reasonably 
complete  transformational  grammar  of  English.  Even  for  the 
sub -grammars  that  exist,  there  are  no  very  efficient  rarsers. 
Parsing  programs  tend  to  give  many  alternative  syntactic 
analyses,  and  we  are  only  starting  to  learn  how  to  utilize 
information  in  our  semantic  store  to  disambiguate  sentences. 
Some  semantic  deductive  systems  have  been  built  ,  but  none 
has  yet  teen  able  to  work  in  a  large  general  data  base.  The 
feedback  paths  among  question  answerer,  semantic  analyzer, 
parser  and  the  user  have  not  really  been  exploited. 

This  paper  has  given  a  superficial  overview  of  the  difficulties 
facing  the  designers  of  computer  systems  which  can  communicate 
in  English.  Progress  in  this  work  will  make  the  computer  more 
accessible  to  more  people  for  more  :blems,  and  lead  to  a 
deeper  understanding  of  natural  language  and  the  communication 
process. 


The  most  interesting  of  these  are  described  In  references 
3,  7,  9,  10  and  12.  Simmonsll  gives  a  complete  descriptive 
survey  of  question -answering  systems  built  before  1965. 
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